Ballymena
Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning
Lei, Fangyu, Meng, Jinxiang, Huang, Yiming, Chen, Tinghong, Zhang, Yun, He, Shizhu, Zhao, Jun, Liu, Kang
Table reasoning, encompassing tasks such as table question answering, fact verification, and text-to-SQL, requires precise understanding of structured tabular data, coupled with numerical computation and code manipulation for effective inference. Supervised fine-tuning (SFT) approaches have achieved notable success but often struggle with generalization and robustness due to biases inherent in imitative learning. We introduce Reasoning-Table, the first application of reinforcement learning (RL) to table reasoning, achieving state-of-the-art performance. Through rigorous data preprocessing, reward design, and tailored training strategies, our method leverages simple rule-based outcome rewards to outperform SFT across multiple benchmarks. Unified training across diverse tasks enables Reasoning-Table to emerge as a robust table reasoning large language model, surpassing larger proprietary models like Claude-3.7-Sonnet by 4.0% on table reasoning benchmarks. The approach also achieves excellent performance on text-to-SQL tasks, reaching 68.3% performance on the BIRD dev dataset with a 7B model. Further experiments demonstrate that Reasoning-Table enhances the model's generalization capabilities and robustness.
- South America > Bolivia (0.14)
- South America > Uruguay (0.14)
- South America > Brazil (0.14)
- (38 more...)
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation
Qi, Siya, He, Yulan, Yuan, Zheng
Hallucination in Natural Language Generation (NLG) is like the elephant in the room, obvious but often overlooked until recent achievements significantly improved the fluency and grammaticality of generated text. As the capabilities of text generation models have improved, researchers have begun to pay more attention to the phenomenon of hallucination. Despite significant progress in this field in recent years, the evaluation system for hallucination is complex and diverse, lacking clear organization. We are the first to comprehensively survey how various evaluation methods have evolved with the development of text generation models from three dimensions, including hallucinated fact granularity, evaluator design principles, and assessment facets. This survey aims to help researchers identify current limitations in hallucination evaluation and highlight future research directions.
- Asia > Singapore (0.05)
- North America > Canada > Ontario > Toronto (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- (15 more...)
- Research Report (1.00)
- Overview (1.00)
ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles
Tu, Haoqin, Yang, Bowen, Zhao, Xianfeng
Automatically generating textual content with desired attributes is an ambitious task that people have pursued long. Existing works have made a series of progress in incorporating unimodal controls into language models (LMs), whereas how to generate controllable sentences with multimodal signals and high efficiency remains an open question. To tackle the puzzle, we propose a new paradigm of zero-shot controllable text generation with multimodal signals (\textsc{ZeroGen}). Specifically, \textsc{ZeroGen} leverages controls of text and image successively from token-level to sentence-level and maps them into a unified probability space at decoding, which customizes the LM outputs by weighted addition without extra training. To achieve better inter-modal trade-offs, we further introduce an effective dynamic weighting mechanism to regulate all control weights. Moreover, we conduct substantial experiments to probe the relationship of being in-depth or in-width between signals from distinct modalities. Encouraging empirical results on three downstream tasks show that \textsc{ZeroGen} not only outperforms its counterparts on captioning tasks by a large margin but also shows great potential in multimodal news generation with a higher degree of control. Our code will be released at https://github.com/ImKeTT/ZeroGen.
- North America > United States > New York (0.04)
- North America > Canada > Ontario > Middlesex County > London (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- (6 more...)
- Leisure & Entertainment (1.00)
- Law > Criminal Law (1.00)
- Government > Regional Government (1.00)
- (4 more...)
New Year Honours 2018: Barry Gibb, Ringo Starr and Darcey Bussell head list
Bee Gees singer Barry Gibb and Beatles drummer Ringo Starr have been knighted, and Strictly judge Darcey Bussell made a dame, in the New Year Honours. Ex-Deputy PM Nick Clegg and War Horse novelist Michael Morpurgo also receive knighthoods, and author Jilly Cooper and TV chef Rick Stein become CBEs. Among five honours for the World Cup-winning England Women cricket team is an OBE for captain Heather Knight. Ex-astronaut Helen Sharman joins the Order of St Michael and St George. Alexandra Shulman, who recently stood down as editor of British Vogue after 25 years; actors Hugh Laurie and Susan Hampshire, and leading artificial intelligence researcher Demis Hassabis are made CBEs.
- North America > United States > California > Los Angeles County > Los Angeles (0.05)
- Europe > United Kingdom > Scotland (0.05)
- Europe > United Kingdom > Northern Ireland > County Antrim > Ballymena (0.05)
- (4 more...)
- Media > Music (0.86)
- Leisure & Entertainment > Sports > Cricket (0.35)